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Abstract — In random-access communication systems, the num- 
ber of active users varies with time, and has considerable bearing 
on receiver's performance. Thus, techniques aimed at identifying 
not only the information transmitted, but also that number, play 
a central role in those systems. An example of application of these 
techniques can be found in multiuser detection (MUD). In typical 
MUD analyses, receivers are based on the assumption that the 
number of active users is constant and known at the receiver, and 
coincides with the maximum number of users entitled to access 
the system. This assumption is often overly pessimistic, since 
many users might be inactive at any given time, and detection 
under the assumption of a number of users larger than the real 
one may impair performance. 

The main goal of this paper is to introduce a general approach 
to the problem of identifying active users and estimating their 
parameters and data in a random-access system where users are 
continuously entering and leaving the system. The tool whose use 
we advocate is Random-Set Theory: applying this, we derive op- 
timum receivers in an environment where the set of transmitters 
comprises an unknown number of elements. In addition, we can 
derive Bayesian-filter equations which describe the evolution with 
time of the a posteriori probability density of the unknown user 
parameters, and use this density to derive optimum detectors. In 
this paper we restrict ourselves to interferer identification and 
data detection, while in a companion paper we shall examine the 
more complex problem of estimating users' parameters. 

Index Terms — Multiuser detection, Random set theory, 
Bayesian recursions. 



I. Introduction 

In typical random-access communication systems, the num- 
ber of active users, their location, as well as the parameters 
that characterize their channel state, vary with time. Thus, 
techniques aimed at identifying not only the data transmitted, 
but also the user parameters, play a central role in analysis 
and design of those systems. Examples of application of these 
techniques can be found in multiuser detection (MUD), spatial 
multiplex schemes, and ad hoc networks. 

In MUD, it has long been recognized that one of the 
important issues is that the set of active users at any time 
may not be known to the receiver. For conventional (matched- 
filter) receivers, this dearth of information does not affect per- 
formance, while for other receivers the simplistic assumption 
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that all users are active will cause significant degradation. Q In 
addition, certain detectors based on interference cancellation 
must know the strongest active users in order to perform 
satisfactorily [7]. Moreover, as observed in [25], "identification 
of active users will help the system to promptly process 
requests and efficiently allocate channels. In such a way, 
system capacity can be increased." 

The problem of detecting active users in a multiuser system 
has been addressed by several authors in a CDMA context [4], 
[7], [16]— [18], [24]. Typically, the resulting multiuser receiver 
is a combination of two separate modules, namely, the active- 
user identifier and the multiuser detector. The treatment in [7], 
[16], [17] focuses on the problem of detecting a single user 
entering or leaving the system. Ref. [24] advocates a subspace- 
based method (MUSIC algorithm) for identifying the active 
users, under the assumption that the receiver knows the pool of 
all possible spreading codes that may be used in transmission. 
In [10], the active-user-identification algorithm is subspace- 
based, as in [24]; the receiver is not interested in decoding 
all active users, but only those transmitting a message to it. 
Ref. [25] addresses the problem of estimating the number of 
active users when synchronous and asynchronous users co- 
exist in the system. 

Further performance improvement can be expected if the 
receiver can exploit a priori information about users entering 
and exiting the system. The knowledge of a traffic model is 
exploited in [5], whose authors use it to improve the detection 
of active users. They model bursty traffic for an individual 
source as a two-state Markov chain. 

Other systems in which user identification is necessary 
include spatial multiplexing schemes, where the total system 
throughput can be optimized by properly selecting a subset 
of users to which the power is allocated [11], [26]. Thus, 
optimum power-control strategy requires the identification of 
this best set of users. In ad hoc networks, optimal transmission 
strategies require the identification and localization of active 
nodes in the neighborhood of the transmitter. 

A. In this paper. . . 

In this paper, and in its companion [2], we examine the 
general problem of detecting data and parameters of a set of 
users whose number is unknown. Unlike previous work done 
in this area, which advocates a two-stage receiver, we focus on 

'See, e.g., [5], [16], [24]. As an example, if a decorrelator detector [21, 
Chapter 5] does more nulling than needed, its performance is impaired. 
Ref. [8] describes a case where a multiuser detector suffers from catastrophic 
eiTor if a new user becomes active. 
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optimal design of a receiver which estimates simultaneously 
the number of users and their parameters and data. The 
receiver performance can be enhanced if one uses the a priori 
information offered by a traffic model, i.e., a dynamic model 
for the number of active users and for their parameters. In fact, 
information from the past history of the parameters may bring 
a considerable amount of extra information if their changes 
are not overly abrupt (for example, if the number of active 
users does not change considerably from frame to frame). 

Having to deal with random sets, i.e., with sets comprising 
a random number of random vectors (those including what 
is unknown about each user), a tool that can be used is 
Random Set Theory (RST: see the Appendix and the references 
therein). RST, recently applied in the context of multitarget 
tracking and identification (see, e.g., [6], [12]— [15], [22], [23]) 
is based on a probability theory of finite sets that exhibit 
randomness not only in each element, but also in the number 
of elements. RST (and in particular its formulation, referred 
to as Finite Set Statistics, or FISST [12]-[14], specifically 
tailored to problems in whose class lie those we are con- 
sidering in this paper) develops concepts which are not part 
of conventional probability theory. In fact, a central point 
in FISST is the generation of "densities" which are not the 
usual Radon-Nikodym derivatives of probability measures, but 
rather "set derivatives" of nonadditive "belief functions." On 
the other hand, these densities, which capture what is known 
about measurement state space, users state space, and users 
dynamics, can be derived in a rather straightforward way from 
the system model by using the FISST toolbox. RST is a tool 
that has considerable generality and flexibility, is consistent 
with engineering intuition, and is easy to useo 

To illustrate application of RST to random-access communi- 
cation, we focus on MUD problems, and derive Bayesian-filter 
equations describing the evolution with time of the a posteriori 
probability density of the unknown user parameters and data. 
Specifically, here we restrict ourselves to interferer identifica- 
tion and data detection, while in a companion paper [2] we 
examine the problem of estimating users' parameters as well. 
We hasten to claim that the applications considered here do 
not exhaust the potential of RST for the analysis of random- 
access system: thus, many of the simplifying assumptions are 
not made because more realistic models cannot be dealt with 
using our theory, but rather because we do not want to muddle 
the intrinsic simplicity of the RST tool with marginal details. 
This paper is organized as follows. Section [XT] describes the 
channel model, while Section [Til] states the MUD problem 
in the context of RST. Section [TV] describes an application 
to CDMA, while Section [V] provides some numerical results 
illustrating the theory. 



2 Advocating the use of RST to solve the problem at hand, we do not imply 
that it is the only tool that can be used. Actually, standard probability theory 
could be applied to achieve the same results, although, we argue, in a less 
elegant and concise way. The companion paper [2], which deals with estimates 
of random sets of continuous parameters, should be even more convincing 
about the usefulness of RST. 



II. Channel model and statement of the problem 

We assume K + 1 users transmitting digital data over 
a common random-access channel. Let s(xj°' 1 ) denote the 
signal transmitted by the reference user at discrete time t, 
t = 1,2,..., and s(X( I ' ) ), i = 1, . . . , K, the signals that may 
be transmitted at the same time by K interferers. Each signal 
has in it a number of known parameters, reflected by the 
deterministic function s( • ), and a number of random param- 

(i) 

eters, summarized by xj ; . The index i reflects the identity 
of the user, and is typically associated with its signature. The 
observed signal at time t is a sum of s(x[°' ) ), of the signals 
generated by the users active at time t, which are in a random 
number, and of stationary random noise z t . We write 

yi =.s(x[ 0) )+ Yl s (xt } )+zt. (1) 

where X t is a random set, encapsulating what is unknown 
about the active users. The notations of ([TJ implicitly assume 
that user is active with probability 1 and its parameters (but 
not its data) are known (this restriction can be easily removed). 

To motivate the development presented in this paper, and in 
particular our use of RST, we proceed to formulate the general 
problem through three intermediate steps. Specifically, we 
examine three scenarios of increasing complexity, under the 
assumptions that the users' parameters are all known, that the 
number of interferers is random and unknown to the receiver, 
and that we are interested in detecting the data transmitted by 
the reference user: 

© The receiver has no information about the a priori prob- 
abilities that the individual interferers are active. Two 
options we may consider here are maximum-likelihood 
(ML) detection of the reference user's data under the 
assumption that all potential interferers are active, or joint 
ML detection of the number of active interferers and of 
the reference user's data. Consider binary transmission 
for simplicity. In the first case, detection implies choosing 
among 2 x 2 K hypotheses. In the second case, the choice 
is among 2 x 3 K hypotheses, as every interferer may 
transmit one between two binary symbols, or be inactive. 
The difference in performance between the two situations 
is illustrated in Fig. [TJ which compares the two detectors 
described above. The ordinate shows the bit error proba- 
bility of the reference user in a multiuser system with 
2 independent interferers transmitting binary antipodal 
signals over an additive white Gaussian noise (AWGN) 
channel with the same a priori probability of activity 
a, spreading-sequences consisting of Kasami sequences 
with length 15 [9, p. 240], and perfect power control. The 
single-user bound is also shown as a reference. It is seen 
that RST yields a detector much more robust than classic 
MUD to variations in the users activity factor. We also 
observe that classic MUD can outperform RST for high 
values of a, as this situation corresponds to its having 
reliable side information about the number of active users. 

© The receiver knows the a priori probabilities that the 
individual interferers are active. System performance can 
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Fig. 1. Bit error probability of the reference user in a multiuser system 
with 2 interferers, independently active with probability a. Lines with diamond 
markers: Classic multiuser ML detection, assuming that all users are active. 
Lines with circle markers: ML detection of data and interferers number. Dashed 
curve: Single-user bound. 
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Fig. 2. Bit error probability of the reference user in a multiuser system 
with 2 interferers, independently active with probability a. Lines with diamond 
markers: ML detection of active users and data. Line with circle markers: MAP 
detection using a priori knowledge of the value of a. Dashed curve: Single-user 
bound. 

be further improved if the receiver is able to exploit addi- 
tional side information in the form of a priori probabilities 
of user activity. By assuming that the activity factor a is 
known, maximum a posteriori (MAP) detection yields the 
results shown in Fig. [2] 
© The receiver has a dynamic model of users ' activity. The 
receiver performance can be further improved by using 
additional information about the interferers, in the form of 
a model of their dynamic behavior. This information can 
be generated once a model of users' mobility is available. 
Observe again that the information carried by the interferers 
is contained in the set 

X t = {x« ...,xf>} 

whose elements are random vectors, and k is itself a random 
integer. RST develops a probability theory on random sets 
of this form, which are modeled as single entities. Roughly 
speaking, a random set is a map X between a sample space 
and a family of subsets of a space S. This is the space of 



the unknown data and parameters of the active interferers. For 
example, if everything about the interferers is known, except 
for their number and identity, then X takes values in the 
power set 2 K , where IK = {1, ... , K}. We may also consider 
a situation in which one or more parameters (the interferer 
power, etc.) are also unknown in addition to the interferers' 
number and identities, while the transmitted data are known 
(for example, in a training phase). In mathematical terms, § is 
generally a hybrid space E> = R d x U, with U a finite discrete 
set, and d the number of parameters to be estimated for each 
user. In the remainder of this paper we shall restrict ourselves 
to the case d = 0, and leave to a companion paper [2] the 
discussion of the case d ^ 0. 

With channel model (Q3, the receiver detects only a super- 
position of interfering signals. Thus, the random set describing 
the receiver, denoted Y t , is the singleton {yt}, where y t has 
conditional probability density function 

/ Yt |x t (yt|B) = / z (y t -a(B)) (2) 

where B = {bi,...,bfc} is a realization of X t , that is, a 
realization of a random set of users and their parameters/data. 
Moreover, / z ( • ) is the probability density function (pdf) of 
the additive noise, and 

a(B) 4 £ 8 (bi) (3) 

A. Defining estimators 

Development of estimators with our model must take into 
account the peculiarities of RST. For example, expectations 
cannot be defined, because there is no notion of set addition, 
and hence estimators based on a posteriori expectations do 
not exist (this point is discussed thoroughly and eloquently 
in [12]). A possible estimator maximizes the a posteriori 
probability (APP) of Xt given yi : x, the latter denoting the 
whole set of observations corresponding to a data frame 
transmitted from t = 1 to t = T, Another possibility is to 
restrict oneself to a causal estimator, which searches for the 
maximum probability of X t given yi :t . In a delay-constrained 
system, one may estimate X t on the basis of the observations 
y*-A:t+A, with A a fixed interval duration (sliding-window 
estimator). 

B. Consideration of a dynamic environment 

Since {Xt}^ forms a random set sequence, the statistical 
characterization of X t is needed for all discrete time instants 
t. If a dynamic model of the transmission system is available 
(which is what we assume in this paper), then the APPs can 
be updated recursively, thus allowing one to take advantage of 
the information gathered from the past evolution of the system. 
We observe in passing that the concept of an adaptive receiver 
was examined previously by several authors (see, e.g., [20] and 
references therein), while the effects on analysis of a dynamic 
model were touched upon, among others, by the authors of [3], 
[7], [16], [17]. 

We make the assumption that {Xf}^ forms a Markov set 
sequence, i.e., that X t depends on its past only through Xt-i. 
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This allows us to use Bayesian-filter recursions for X t [13]: 

/x t+1 |Y 1:t (B I yi:t) 

= J /x t+1 |x ( (B | C)/ Xf , Yl:f (C | y l!t )aC (4) 

/x t+1 |Y 1:f+1 (B I yi:t+i) 
oc / Yt+1 |x t+1 (y*+i | B)/ Xt+1 | Yl:t (B | yi :t ) (5) 

The integrals appearing in the equations are set integrals, 
defined in the sense of RST (see the Appendix). The notation 
5 for the differential reflects this definition. 

Thus, the causal maximum-a-posteriori estimate of X t is 
obtained by maximizing, over B, the APP /x t |Yi t (B | yi-.t), 
which is tantamount to minimizing 

m (B) — (yt — f(B)) 2 — e(B) 

where e(B) = N In /x t |Y 1:t _i (B | yi :f -i). The first term 
in the RHS of definition above is the Euclidean distance 
between the observation and the sum of the interfering signals 
at time t. Its minimization yields ML estimates of X t . The 
second term in the RHS, generated by the uppermost step of 
iterations, reflects the influence on X t of its past history, and 
its consideration yields MAP estimates. 

The Bayesian-filter recipe (@]i-(|5]l requires two ingredients. 
The first one is the channel model, through the pdf / z (yt | 
Xt). For example, assuming real signals, and the noise to be 
Gaussian with mean and known variance No/2, we have 

/ z (yt | Xt) ex exp{-(yt - a(Xt)) 2 /iVo} 

The second ingredient is the dynamic model of the random 
set sequence Xt, described by the function /x t+1 |x t ( ' I ' ) 
that describes the time evolution of data and parameters of the 
system. Examples of this modeling procedure are available for 
the problem of tracking multiple targets [13], [22]. 

From now on we restrict ourselves to the detection of the 
number and identity of active interferers, and of the data they 
carry, under the assumption that the remaining parameters, 
which were previously estimated by the receiver in a training 
phase, do not change in any appreciable way during the 
tracking phase. Estimation of these parameters using RST is 
described in the companion paper [2]. 



III. Detection 



A. Active users 



We assume first that we are only interested in detecting 
which interferers, out of a universe of K potential system 
users, are present at time t. This information may be used for 
example to do decorrelation detection, under the assumption 
that the signatures of all users are known at the receiver. In 
our theory, X t takes values in 2 K . Since this set is finite, a 
probability measure for X t can be defined by assigning all 
probabilities P(A), A g 2 K . 

1) Static model: At any fixed time t, suppose that the 
probability of interferer to be active is a, independent 
of t and i. In this case the probability of the interferer set Xf 
depends only on its cardinality |Xf|, and we can write 



To derive this result, we use RST by first computing the belief 
function 

/?x(S) 4 P(XCS) 
s 

= E E p ( x = B ) 

j=0 B:BCS & |B|=j 

and subsequently computing its set derivative, which, in the 
discrete case, becomes the Mobius inversion formula (f50b . 

2) Dynamic model: Consider now the evolution of X t . We 
assume that from t — 1 to t some new users become active, 
while some old users become inactive. We write 



Xt = S t U N t 



(8) 



where S t is the set of surviving users still active from t—1, and 
N f is the set of new users becoming active at t. The condition 
NfflXf-i = is forced, because a user ceasing transmission 
at time t — 1 cannot reenter the set of active users at time t. 
We proceed by constructing separate dynamic models for St 
and Nt, which will be eventually combined to yield a model 
for X t . 

Consider first St- Suppose that there are k active users at t— 



1, the elements of the random set X t _i = {xj_ 1; . 
Then we may write, for the set of surviving users, 



i=i 



(9) 



where Xj denotes either an empty set (if user i has become 
inactive) or the singleton {xj 1 ' 1 } (user i is still active). Let /i 
denote the "persistence" probability, i.e., the probability that 
a user survives from t — 1 to t. We obtain, for the conditional 
probability of St given that Xf_i = B: 



^(OIB)-!^ 1 -") 



BHC| ; CQB 
C £ B 



(10) 



For new users, a reasonable model has 



/njx^CIB) 



a |C| (1 _ a )X-|B H C| ; CnB 

o, cnB/' 



/ Xt (B)=al B l(l-a)*-l B l 



(6) 



(ID 

Finally, by assuming that births and deaths of users are 
conditionally independent given Xt-i = B, the conditional 
pdf of the union of the random sets St and N 4 is obtained 
from the generalized convolution [6] 

/x^x^CIB) 

= E /s t |x t _ 1 (W|B)/ Nt | Xt _ 1 (C\W|B) 
wcc 

= /s t |x ( _ I (CnB)/ N( | Xi _ I (C\(CnB)) (12) 
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3) B ay esian- filter recursions: In our context, recur- 
sions (0]i-([5]l can be implemented as follows. Determine first: 

© The a priori probability distribution of Xo at the be- 
ginning of the detection process. Description of this 
distribution consists of assigning probabilities to all the 
elements of 2 K . This can be done for example by assum- 
ing independent users with the same stationary activity 
factor. 

© The set of observations y t , t = 1, . . . , T. 
© The conditional pdf's /Y t |x t > depending on the channel 
model. 

© The "evolution" pdf's /x t+ i|x t > depending on the dy- 
namic model. 

The recursion goes as follows: omitting the subscripts for 
notational simplicity here, and identifying random sets with 
their realizations, we have 



f(X 1 ) = / /(Xx |Xo)/(X„)(SXo 

With this, we can compute 

/(Xi |yi)cx/(yi |Xi)/(X x ) 

which allows the calculation of the causal MAP estimate Xi. 
Next, we compute 

/(X a | yi) = J /(X a | X x ) /(Xi | yi ) 5X l 

and hence 

/(X 2 | yi.-a) oc /(y 2 |X 2 )/(X 2 | y x ) 

which allows the calculation of X 2 . The general recursion has, 
for t = 2, . . .: 

/(Xt+i | yi : t) = j /(X t+1 | X t ) f(X t | y 1:t ) SX t 

/(X t+ i | yi: t +i) oc f(y t +i | X f+ i)/(X t+ i | yi :t ) 

and, in the case examined in this section, 

/(X t +i | yi:t) 

= J2 /(Xt+i | X t ) /(X 4 | y 1:t ) (13) 

X t 62 K 

/(x t +i I yi.-t+i) 

oc / z (yt+i - tr(X t+ i)) /(Xt+i | y 1:t ) (14) 

B. Active users and their data 

Assume binary information data, independent from time to 
time and across users, and a discrete-time unit such that from 
t to t + 1 each user transmits N binary symbols. In this case 
Xt takes values in a set with 



K 

E 

k=0 



K 



(1 



elements, that we denote (1 + 2 N ) K . Eq. © becomes 
/x t (B) = 2~ N WaW(l-a) K -W 



where the new factor 2 _Ar l B accounts for the fact that there 
are iV|B| equally likely binary symbols transmitted at time t 
by |B| interferers. 

Similarly, ( TTOb is transformed into 



/ St | Xt _ 1 (C|B) 

2-^|cl M lc|( 1 _ /i )|B|-|C| ) CgB 

0, C^B 
and (fTTT > into 

(C | B) 

2- Ar l c lal c l(l-a)^-l B l-l c l, CHB = 
0, CnB^ 



(16) 



(17) 



C. Possible scenarios 

We recall that throughout this paper we assume that the only 
unknown signal quantities may be the identities of the users 
and their data. Specifically, we may distinguish four cases in 
our context: 

© Static channel, unknown identities, known data. This 
corresponds to a training phase intended at identifying 
users, and assumes that the user identities do not change 
during transmission. In this case we write X in lieu of 
Xt. 

© Static channel, unknown identities, unknown data. This 
may correspond to a tracking phase following © above. 
We write again X in lieu of X t , and assume that X 
contains the whole transmitted data sequence. 

© Dynamic channel, unknown identities, known data. This 
corresponds to identification of users preliminary to data 
detection (which, for example, may be based on decorre- 
lation). 

© Dynamic channel, unknown identities, unknown data. 
This corresponds to simultaneous user identification and 
data detection in a time-varying environment. 



IV. An example of application 

Assume now the specific situation of a DS-CDMA system 
with signature sequences of length L and additive white 
Gaussian noise. At discrete time t, we may write, for the 
sufficient statistics of the received signal, 



y t =RAb t (X t ) + Zi, 



t = 1, 



(18) 



(15) 



where X t is now the random set of all active users, R is the 
L x L correlation matrix of the signature sequences (assumed 
to have unit norm), A is the diagonal matrix of the users' 
signal amplitudes, the vector bf(Xt) has nonzero entries in the 
locations corresponding to the active-user identities described 
by the components of X t , and z t ~ N(0, (iVo/2)R) is the 
noise vector, with No/ 2 the power spectral density of the 
received noise. We further assume N = 1, i.e., that at every 
discrete time instant only one binary antipodal symbol is 
transmitted. 
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A. Static channel 

The a posteriori probability of X, given the whole received 
sequence (we omit the time subscript for simplicity), is 



/(X | yi, . . . ,y T ) 

/ x (X)/(yi,...,y T | X) 



(19) 



j-^ E (y* - RAbtCXJj'R- 1 (y t - RAb t (X))| 



= cxp 

x/x(X) 

Thus, the MAP estimator of users' identities is 

X = argmax/(X | yi :T ) 



(20) 



where, as usual, yi-y = yi, . . . ,yr- The MAP estimator of 
users' identities and data is 



X = argmax/(X | y 1:T ) 



(21) 



where the set of possible realizations of X includes (1 + 2 T ) K 
elements: in fact, in T time interval the number of transmitted 
binary symbols is 2' X I T , and 

K 



E 

|X|=0 



2 |X|T = (1 + 2 T ) 



K 



The expression above can be rewritten in such a way that 
the presence of the sequence of transmitted data is made more 
explicit. Specifically, we may write, in lieu of X, the sequence 
(X, bi(X), . . . ,bx(X)). Doing so, we may express the MAP 
estimator of users' identities and data in the more explicit form 



(X,b 1 (X),...,b T (X)) 
= argmax/(X,bi(X), 



,br(X)|y 1:T ) (22) 



where the maximum has to be taken with respect to X and 
bt(X), t = The introduction of this "fine-grain" 

notation for the random set suggests that the MAP detector 
may be implemented in the form of a sequential detector, thus 
simplifying its operation (more on this infra). 

Before moving on, it may be worth pointing out that, 
for the case considered here, the same results could be 
obtained through ordinary probability theory by introducing 
a if-dimensional vector sequence, taking on binary or ternary 
values according to whether a training sequence is used or not: 
as already anticipated, though, such equivalence holds only for 
the discrete case, and, unlike the RST formulation, does not 
suggest an immediate extension accounting for more general 
channel models [2]. 

B. Dynamic channel 

Consider now a dynamic channel, and examine first the case 
of known data. We have, accounting for the Markov property 
of our channel model, 

/(Xi,-,Xt |yi:r) (23) 

T 

/(y 1:T | Xi, . . . , X T ) x /(Xx) [] /(X t | X t _i) 

t=2 



OC 



with /(Xi) a density whose assignment is based upon prior 
knowledge of the channel state at the beginning of the trans- 
mission. The MAP estimator here maximizes the RHS of the 
above (or its logarithm) with respect to the values taken on by 
the sequence (Xi, . . . , Xp). Even in this case we may think 
of a sequential detector, which searches for the maximum- APP 
path traversing a trellis having T stages and a number of states 
at stage i equal to the number of realizations of the random 
set X^ 

a) Implementing a sequential detector.: Implementation 
of the sequential detector through a version of Viterbi algo- 
rithm leads to the following consequences: 

© The decision on the whole sequence of users' identities 
and their data should be taken only after the whole 
sequence of observations yi , . . . , yr has been recorded. 

© The decision on the users' identities and their data at time 
t depends not only on the past observations, but also on 
observations that have not been recorded yet at time t. 

© A suboptimum version of the optimum sequential algo- 
rithm, the sliding-window Viterbi algorithm (see, e.g., [1, 
p. 133 ff.]) can be implemented. This consists of forcing 
a decision on X t , b t (X f ) based on a sliding window of 
observations that includes y t , but whose length is smaller 
than T. 



C. PEP analysis 

We now evaluate the performance of the detectors described 
above. We assume N = 1 for simplicity, and derive bounds 
and approximations to error probabilities using the pairwise 
error probability (PEP) P(X t — ► Xt). This is the probability 
that, when X t is the true value of the random set to be 
detected, the receiver assigns a higher APP to X t ^ X t (see, 
e.g., [1, p. 43]) @ 

1 ) Static channel: Defining 



S t (X,X) 

we have 
and 



(y t -RAb t (X))(y t -RAb t (X))' 
St(X,X)=R- 1 z^ 



(24) 



y t =RAb f (X)+z f 



(25) 



S t (X, X) = R- 1 [ (RAd t (X, X) + z t ) (RAdt (X, X) + z t ) '] 

(26) 

where d t (X,X) = b t (X) — b t (X). Based on the above, the 
PEP with ML detection of unknown user identities can be 
written as 



P(X 



tr 



^S t (X,X)-S t (X,X) 



< 



(27) 



3 It might be worth observing here that, contrary to a fairly widespread 
misconception, P(Xt — * Xt) is not the probability of mistaking Xt for Xt, 
unless Xt and Xt are the only possible alternatives. 
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Now, observe that 
tr 



S t (X,X)-S t (X,X) 
= tr [R^ 1 (RAd t djAR + z ( d;AR + RAd t z^ 
= tr [Ad t d;AR + R- 1 z t d' t AR + Ad t z' t ] 
= tr [Ad t djAR + z t d' t A + Ad t z' t ] 
= tr [RAd t djA] + 2tr [Ad t z' t ] 



Denoting by a(i) the ith diagonal element of matrix A, by 
dt(i) the ith entry of d t , by rjk the entry in row j and column 
k of R, and by z t (i) the ith entry of vector z t , we have 

K K 

tr[RAd t d;A] = ]T^a(Oa(jX(*H(jK, (28) 

i=l j = l 
K 

tr[Ad t z' t ] = 5^o(i)dt(i)2t(0 (29) 
i=i 

Finally, since we are assuming z f ~ K(0, (iVo/2)R), we have 



tr 



^(s t (x,x)-s t (x,x; 



W(£ T ,27V £ T ) (30) 



where 



T K K 
t=l i=l 3=1 



In conclusion, we obtain 

P(X^ X) 




(31) 



(32) 



where Q(-) denotes the Gaussian tail function. 

Before proceeding further, we comment briefly on the 
structure of vectors d t . They have the following nonzero 
entries: 

© The |XnX| terms corresponding to users present in both 
sets: these terms may take on values in {0, ±2}. 

© The |X \ Xn X| terms corresponding to users present in 
X only: these terms may take on values in {±1}. 

© The |X \ Xn X| terms corresponding to users present in 
X only: these terms may take on values in {±1}. 

Similarly, the PEP with MAP detection has the form 



where 



P(X -> X) = Q 



7] = N Q In 



/x(X 



(33) 



_/x(X)_ 

Observe here that, with R a positive definite matrix, we have 
(Ad t )'R(Ad t ) > for Ad t ^ 

which entails 



lim P(X -> X) 

T— >oo 







Notice also that, for given signal-to-noise ratio, d33l also 
suggests a minimum length of the data frame in the form of 
the "open-eye" condition T > T m i n , where 

T min ==inf |t: min[£r(X,X) - »?(X,X)] > o| 

The PEP for the case of detection of user identities and 
data can be dealt with with similar techniques, and we shall 
not delve in this issue any further here. 

2) Dynamic channel: In this case, defining the true state 
sequence X = (Xj)^_ 1 and the competing state sequence 
X = (%)f =1 , we obtain the PEP for the MAP detection of 
unknown identities: 



f(X -► X) 



Ct(X,X)-^t(X,X) 



2iV f T (X,X) 



(34) 



and the PEP for the MAP detection of unknown identities and 
data: 

p(x, (b t (x t ))Li-x, (b t (x f ))Li) 

fr(X J X)-ifr(X ) X) 



Q 



where 

fr(X,X) 

?7t(X,X) 
»7r(X,X) 



2^Vo^t(X,X) 

d^ARAdt 

/x t |x t _! (X t | X t _i] 



t=i 



t=i 

= ??t(X)X) 



/x t |x t _i(Xt | X t _i, 



IX, 



In 2 



(35) 

(36) 
(37) 

(38) 



Similar arguments, which we omit here for brevity's sake, 
apply to ML detection. 

D. Error probabilities 

Several approximations to error probabilities are possible, 
based on the union bound (see, e.g., [1]), on the PEP deriva- 
tions outlined supra, and on assumptions on user statistics, 
spreading codes, and users' amplitudes. We obtain, for the 
union bound to the probability of mistaking the set of active 
users, 

2 K 

P(e) < ]T /(X,) ]T P(X, - X,) (39) 

i=l j^ti 

which, under maximum prior uncertainty as to the channel 
occupancy, becomes: 



(40) 



i=l jyti 



where Xj,Xj € 2 K . This union bound can be simplified by 
restricting the inner summation to those pairs of realizations of 
the random sets that are most likely to contribute significantly 



g 



to error probability. For example, if we restrict it to the pairs 
that differ in at most n entries, we obtain an approximation 
depending on n: 

2 K 

P( e )«pM(e)A_L£ J2 PfXi-Xj) (41) 

i— 1 j:Si j <n 

where 

s itj 4 |Xi \ x, n Xj | + \Xj \ x< nXj-| < n 

Likewise, for a dynamic scenario, we have that the union 
bound for user identification is written as: 

P(e)< /QO X^Qt-X) < 42 > 

where /(X) can be easily determined by applying the chain 
rule to the Markov set sequence X- Approximations similar 
to ( l41~b can be developed; likewise, the case of joint user 
identification and data detection can be handled by noticing 
that the configurations of X become now 3 KT , and the joint 
density /(X) is written as: 

T 

/(X) = 2-l Xl l/(X 1 )J]2-l x 'l/(X t | X t _x) (43) 

t=2 

The above relationships also suggest a semi-analytical 
method to evaluate the approximations without summing up 
an exponential number of terms: indeed, since an average over 
the joint density d43l is to be performed, this can be efficiently 
evaluated through Monte-Carlo counting by generating a sub- 
stantially smaller number of independent set sequence patterns 
obeying the Markov law d43l . 

V. Numerical results 

In this section we show some numerical examples that 
illustrate the theory developed before. 

Fig. |3] shows how the knowledge of the channel dynamics 
can improve the performance of a multiuser detector. 

Fig. E] refers again to a static channel and to the case that 
the active users transmit a known sequence of bits in order 
to be identified: we assume here that all users (including the 
reference user) are active with probability a = 0.5. Now K = 
6, the transmitted signals are binary antipodal, spreading is 
done through m-sequences with length 7, the power control 
is perfect (hence, A is a scalar matrix) and the data-frame 
length varies from T = 1 to T = 3. Here we evaluate the 
accuracy of the union bound to the probability of an error in 
the identification of active users (set-error probability, or SEP), 
and of its approximation P^'(e) (obtained by assuming that 
the errors can only be generated by the event, denoted £(1), of 
mistaking an active-user set by another differing by only one of 
its elements). Simulation results are also shown for reference's 
sake. It can be observed how, especially for large values of 
signal-to-noise ratio, the error probability is dominated by the 
event £(1). 

Fig |5] refers to a system with the same configuration as in 
Fig. IU but on a dynamic channel with K = 3, a = 0.2 
and fj, = 0.8. The system dynamics are tracked over an 




£ io' 4 



3 4 5 6 7 8 9 

E b /N Q (dB) 

Fig. 3. Bit error probability of the reference user in a multiuser system with 2 
interferers, following a dynamic model described above with a = 2 and u, = 
0.8. Line with diamond markers: Classic multiuser ML detection, assuming 
that all users are active. Line with circle markers: MAP detection based on 
the knowledge of a alone. Line with star markers: causal RST detector, based 
on Bayes recursions. Line with square markers: Viterbi RST detector. Dashed 
curve: Single-user bound. 




2 4 6 8 10 12 U 

E b /N Q [dBJ 



Fig. 4. Set-error probability with ML detection based on RST with K = 
6, L = 7. Comparison among "exact" probability ( obta ined by simulation), 
union-bound to it (denoted P(e) ), and approximation (41t to the union bound. 

interval with length T = 10. The ordinate shows the set 
error probability (SEP), i.e., the probability of an erroneous 
estimate of the active-user sequence. Here a comparison is 
made between a non-causal Viterbi set estimate and a causal 
estimate, obtained through Bayesian-filter recursions. To elicit 
the impact of the causality constraint, we represent the SEP for 
the set Xi, where the causality constraint prevents sequence 
detection, and for the set Xio, where such a constraint has no 
effect: as expected, the performances of the Viterbi algorithm 
and of the Bayesian recursions coincide when estimating Xio, 
while the causality constraint has a perceivable effect on the 
performance when Xi is estimated. 

In order to provide global figures of merit of both trained 
and untrained systems in a dynamic environment, we use the 
"Set Sequence Error Probability"(SSEP). For trained systems, 
this is the probability that for some t, 1 < t < T, the estimated 
set X t differs from the true set X t either in its cardinality or in 
its elements. For untrained systems, it is the probability that 
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Dynamic Channel, u=0.8, rc=0.2, K=3, T=10, L=7 




10 ' 1 1 1 1 1 ' 

2 4 6 8 10 12 14 

E b /N Q [dBJ 

Fig. 5. Trained acquisition of the set of active users through the Viterbi 
Algorithm and Bayes recursion: effect of the causality constraint 

at some t the estimated and the true set differ either in the 
cardinality and/or in the identities of the active users and/or 
in the transmitted data. Plots of the SSEP are shown in Fig. [6] 
for a trained system with K = 6 maximum number of active 
users: also shown in the figure is the curve obtained through 
the semi-analytical approximation suggested in the previous 
section, which apparently follows the numerical results quite 
closely. 




2 4 6 8 10 12 14 

E b /N Q [dB) 

Fig. 6. Set sequence error probability with K = 6, L = 7. Also, the 
curve corresponding to a semi-analytical performance evaluation under trained 
acquisition of the set of the active users. 

The case that not only the identities, but also the data of the 
active users are to be estimated is shown in Fig. [7] assuming 
a maximum of K = 3 active users and, again, L = 7; the 
data-frame length is T = 10. Here we compare a Viterbi- 
algorithm receiver with one based on Bayesian recursions for 
estimating the set of interferers and the transmitted bits. The 
ordinate shows the bit-sequence error probability, at time t = 
1 and at time t = T = 10, defined as the probability that 
the estimated and the true set do not coincide: the term "bit 
sequence error probability" is tied here to the fact that an error 
in estimating the identities of the active users automatically 
implies an error in estimating the stream b(X t ), while the 
converse is not true. Once again, the effect of the causality 



constraint on the performance is elicited, and the results are 
in accordance with the intuition as well as with the curves of 
Fig. 13 



Dynamic Channel K=3, T=1 0, L=7, u=0.8,ct=0.2 




2 4 6 6 10 12 14 

E/N [dB] 



Fig. 7. Bit-sequence error probability of the reference user. Data estimated 
with Viterbi algorithm and Bayesian recursions. 



VI. Conclusions 

We have described a technique for estimating the received- 
signal parameters and data in a multiuser transmission system. 
Since the number of active interferers is itself a random 
variable, the set of parameters to be estimated has a random 
number of random elements. A dynamic model for the evolu- 
tion of this random set accounts for new interferers appearing 
and old interferers disappearing in each measurement inter- 
val. Random-set theory can be used to develop a multiuser 
detection scheme in this context. This is done by developing 
Bayesian-filtering equations that describe the evolution of the 
MAP multiuser detector in a dynamic environment. 

One question that may naturally arise at this point concerns 
the need for RST, and in particular the question if the 
results obtained with RST can also be obtained in a more 
straightforward way. Although other ways of obtaining the 
same results with conventional probability techniques may 
be available (see,e.g., [23], where the connections between 
RST and point processes are explored), we argue that not 
only the rigor and the generality of RST, but especially its 
simplicity, make it a tool of choice for the study of random- 
access systems. The results of this paper are also meant to 
support this claim. 

Appendix 

This appendix describes, in a rather qualitative fashion, the 
fundamentals of Random-Set Theory. For a rigorous approach 
and additional details, see [6], [22], [23] and the references 
therein. 

Given a sample space fl (the space of all the outcomes of 
a random experiment), a probability measure can be defined 
on it. If a random variable (i.e., a mapping from SI to another 
space §) is defined, it is convenient to generate a probability 
measure directly on S. This can be given in terms of a 
density function, once certain mathematical operations, such 
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as integration, are defined on 8. Random sets can be viewed as 
a generalization of the concept of a random variable. A finite 
random set is a mapping X : — * ?(§) from the sample 
space il to the collection of closed sets of the space §, with 
|X(oj)| < oo for all ui £ f2. For our purposes, the space 
§ of finite random sets is assumed to be the hybrid space 
E> = R d x U, the direct product of the e?-dimensional Euclidean 
space R d and a finite discrete space U. The elements of § 
characterize the users' parameters, some of which continuous 
{d real numbers) and some discrete (for example, the users' 
signatures and their transmitted data). An element of § is the 
pair (v, u), v a <i-dimensional real vector, and u E U. 

A. Belief mass functions 

A fairly natural probability law for X is the probability 
distribution fx, defined for any (Borel) subset T of ?(§) by 

P X (T) =P(XeT) 

However, RST is based on a probability law given differently. 
Specifically, the belief mass function of a finite random set X 
is defined as 



/3x(C) 4 P(X C C) 



(44) 



where C is a closed subset of §. As observed in [22, p. 42], 
the belief function corresponds to the cumulative distribution 
function of a real random variable, and differs from it because 
subsets are only partially ordered by the inclusion relation C. 
The belief function characterizes the probability distribution 
of a random finite set X, and allows the construction of a 
density function of X through the definition of a set integral 
and a set derivative. Specifically, the belief density, i.e., the set 
derivative of the belief function, plays the role of a probability 
density function in ordinary probability calculus (for this 
reason, in this paper we refer to it simply as a density). The 
belief density is obtained as 

5/3(3) 



/x(Z) = 



sz 



(45) 



where S denotes the set derivative, to be defined below. As 
observed in [6, p. 163], the value /?x(Z) of the belief density 
specifies the likelihood with which the random set X takes 
the set Z as a specific realization. 

Notice how, in the special case of a random set consisting 
of a singleton, X = {x}, x a random vector, we have 

/?x(C) = P(X CC) = P(x e C) = P(C) 

with P( ■ ) the ordinary probability measure of x. 



B. Set derivative 

Let C(S) denote the collection of closed subsets of S. If F 
is a set function defined on S(S), then its set derivative at x 
is defined as the set function 

6F{S) = lim lim lM\ B '0-/3))\jB;0-/i)]-F(S) 



Sx 



m(B x (i/i)) 



where B x (l/j), B x (l/i) are an open ball of radius 1/j and a 
closed ball of radius respectively, both centered at x, and 



m(-) denotes the hybrid Lebesgue measure, i.e., the product 
of the ordinary measure in M. d and of the counting measure. 
The set derivative of F at a finite set X = {xi, . . . ,x n } is 
defined by the recursion 

SF(S) A S / SF(S) 
5X Sxn \<5{xi, . . . ,x„_i} 

In particular, the belief density of the random set X is given 
by 

Spx(9) 



/x(B) 



SB 



(46) 



Two useful rules of derivation are the following (see 
also [15, p. 386 ff.]). Let F, G be set functions, and a,del. 
Then 



6(aF(S) + 6G(S)) _ JF(S) + & <SG(S) 



and 



SB 

SF(S)G{S) 
SB 



E 

CCB 



SB SB 

SF(S) SG{S) 
SB S(B\C) 



(47) 



(48) 



C. Set integral 
Let / denote a set function defined by 

6F(S) 



/(X) 



8X 



S=0 



The set integral of / over the closed subset S C § is given by 
f(X)5X (49) 

= /({0» + E f sk ^ Xl ' ■ • • ' Xk})drh(*i) ■ ■ ■ dfh{* k ) 



fe=i 



where /({xi, . . . , x^}) = if Xx,...,Xfc are not distinct. 
Since we are dealing with finite random sets, the summation 
above contains only a finite number of terms. 



D. Special case: d = 

The special case d = (which corresponds to making § 
a discrete finite set) reduces the set derivative to the Mobius 
inversion formula [22, p. 43]: 



/x(A) = J2 (-l) ,MB| /3x(B) 



(50) 



BCA 



and the set integral to 



1 

/(W)+EfeT E (5D 

fe=l x 1 ^£...^j£x k 

where the summation is extended to all possible combinations 
of k distinct elements x& € S (in this case, the hybrid 
Lebesgue measure reduces to the counting measure, and hence 
the Lebesgue integrals in d49l become summations). 
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E. Generalized fundamental theorem of calculus 

Set derivatives and set integrals turn out to be the inverse 
of each other: we have 



/(X) 



SF(S) 



5X 



F(S) 



S=0 



/(X) SX (52) 



By using the above result, belief functions and belief densities 
can be derived from one another. 
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